Emergence of partides from bosonic 
quantum field theory 
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An examination is made of the way in which partides emerge from 
linear, bosonic, massivc quantum ficld theorics. Two diffcrent con- 
structions of the one-particle subspace of such theories are given, 
both illustrating the importance of the interplay between the quant um- 
mcchanical linear structure and the classical one. Some comments 
are made on the Newton- Wigner representation of one-particle states, 
and on the relationship between the approach of this paper and those 
of Segal, and of Haag and Ruclle. 
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1 Introduction 

For better or for worse, most quantum systems are found by starting with 
a classical system and then quantizing it. The states of the resulting quantum 
system will be described by complex functions on the configuration spacc of the 
classical system, whose squared moduli teli us the probability density for finding 
the system in a given configuration. 

Applying this method to classical ficlds would secm, if not unproblcmatic, 
then at least difhcult only for technical reasons. Wc would naturally expect to 
find a theory whose states are wave-functionals on the configuration spacc: that 
is, maps which associate a complex number to cach configuration of the classical 
field on a given hypersurfacc. 

Scarcely a vestige of this behaviour is seen in the usual phenomenology of 
'quantum field theory'. Instead we find ourselves with a theory usually described 
in terms of partides: quarks, gluons, electrons. . . , and the localized interactions 
between them. The field-configuration viewpoint is occasionally seen (notably 
in the path-intcgral formalism of quantum ficld theory) but is usually regarded 
as at best a calculational tool. 
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Furthermore, these partides are notoriously strange entities. Various results 
of quantum field theory seem strongly to imply that they cannot be localized 
in any meaningful, covariant way; that they must be crcatcd and annihilatcd in 
interactions which cannot be spatio-tcmporally localized; that we cannot start 
with a theory of free partides and 'turn on' an interaction without pathology 
(implying that the concept of particle is bound up with the dynamics of the 
theory and is not just a kinematic concept); and that the partides which should 
be associated with a given field theory vary according to the energy levels at 
which that theory is studied. 

For these reasons, it is normal in modern quantum field theory to regard 
the field as the primary concept and the partides as secondary, derivative enti- 
ties. This process has been studied extensively using the methods of algebraic 
quantum field theory and the signs are encouraging that it can be understood 
in a mathematically and conceptually rigorous way; however, the very abstract- 
ness of these methods can make it difficult to understand quite why the idea of 
'particle' should be so powerful in understanding the prima facie very diffcrcnt 
concepts inherent in a quantised field. 

The purpose of this paper, then, is to analysc in a fairly concrete context 
the way in which certain subspaces of a quantum field theory's Hilbert space 
come to possess characteristics of a one-quantum-particle Hilbert space. 'The 
concrete context' in question is that of a massive, sealar, bosonic field, assumed 
to be asymptotically treatable as a free field. Section |^ presents the classical 
and quantum theory of such a field, and section || considers what the correct 
definition should be of 'local' and 'particle' states in QFT. In sections ||^| - 
which form the core of the paper — two separate constructions of the one- 
particle subspace are given, both of which illustrate the central role played by 
the interaction between the linear structure on the QFT Hilbert space (present 
in any quantum system) and the linear structure on the classical phase space 
(specific to a linear field theory) . Section ^| discusses the Newton- Wigner rep- 
resentation of position for one-particle systems, and section || makes some brief 
comments on the relationship between this paper's approach to QFT and some 
other approaches; section ^ is the conclusion. 

Currently, it is increasingly common for foundational discussions of QFT 
to be conducted in the powerful and abstract language of algebraic quantum 
field theory. This paper eschews that tendeney: some concepts and results of 
algebraic QFT are referred to, but the framework used here is much closer to that 
used by the mainstream physics community. For a defence of the validity of using 
this apparcntly rather non-rigorous framework for a foundational discussion, see 
Wallace (200"ïh| ). 

2 Field quantization 

In this section, we shall review the mcthod by which free, and weakly inter- 
acting, field thcories are quantized. We shall outline the problems which oceur 
when we try to reinterpret these quantized thcories as fundamcntally about 
partides, and then consider, in qualitative terms, how partides can enter the 
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theory in a non-fundamental way. 
2. 1 Classical free fields 

A fairly general second-order field cquation for a free-field theory is 
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^0(x,í)+^(x,í) =0 (1) 

where </>(x, í) is a real field defined on some manifold S x R, points on S are 
labelled by x, and 1Z is some real symmetric operator acting on functions of the 
spatial coordinate x. 

The simplcst examplc of such a theory is the real Klcin-Gordon cquation, 
for which 

Tl = m 2 -V 2 , (2) 

and in fact if we want a theory with relativistic covariance then there are no other 
examples. However, more general theories of this form are potential models for: 

• Curved spacetimes with a timc translation symmetry; 

• Systems interacting with a time-independcnt background field; 

• Solid-state systems. 

Most crucially for our purposes, field theories with this form also occur as ap- 
proximations to nonlinear theories; we will consider this case in more detail 
later. 

Such a theory can be generated from a Lagrangian 

m <i>] = \f d3x (<^ x ) 2 - ^)W)w) • (3) 

Carrying out the Legendre transform to the Hamiltonian formalism, we get 
a set of canonical coordinatcs <fi(x) labelled by x, a set of conjugate momenta 
7r(x), and a Hamiltonian H, where 

SL 



=# TTTT = fa ( 4 ) 



Hfo *\ = \J d 3 x (tt(x) 2 + 0(x)(^)(x)) . (5) 

Points in the phase space V of the field are then given by specifying pairs of 
functions (<fi, 7r); the Poisson bracket on V is given by 

r „ , r „ /" o (SA SB SA SB \ 

so of course <j> and ir obey the canonical relations {</>(x), 0(y)} = {""(x), 7r(y)} = 
Oand {0(x),7r(y)}=5(x-y). 

Through each point in phase space flows a unique trajectory; hence points 
in V are in one-to-one correspondence with solutions of (Q). 
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2.2 Field quantization 



Wc will quantizc classical ficlds (free or interacting) in the most naivc possi- 
ble way: by direct comparison with non-relativistic partició mcchanics. That is, 
we will represent states of the quantum system by complex wave-functions on 
the configuration space of the classical system. In tiris casc. that configuration 
space is the infinitc-dimcnsional space S of functions on S, so the quantum 
states will be functionals ^ [x] on this space (we will denote the Hilbert space of 
all such functionals as Hs). By analogy with the non-relativistic quantization 
of the coordinates q,p as 



qtp = qip(q); pip 



we will quantize the coordinates </>( x ) and 7r(x) as 

(0(x)*)[x] =x(x)*[x]; 



(7t(x)*)[ X ] 



It is easy to check that the canonical commutation rclations are satisficd: 



^( x )><A(y) = 7r(x),7r(y) 



0( x ),0(y) =^(x 



0: 



(7) 

(8) 
(9) 

(10) 
(11) 



It is to be admitted that we have been very cavalier with our treatment 
of the infinite-dimcnsional spaccs in usc herc. It is possible (whilst wc confinc 
ourselves to free fields) to be much more careful and rigorous,[] but if we wish our 
framework to be powerful cnough to handle intcractions then there is actually 
no need for infinite-dimcnsional technicalitics, for reasons to be cxplained in 



section 2.3 



2.3 Interactions and renormalisation 



Formally speaking, nothing in the previous description will be altered if we 
add some higher-order terms (such as c/> 4 ), which change the field equation from 
free to interacting: we could restrict our attention to regimes in which these 
terms are small in comparison to the free-field Hamiltonian, and proceed to 
analyse their effects using perturbation thcory. 

However, the reader may at this stage object that we are playing fast and 
loose with some very poorly-dcfincd mathcmatical concepts. In fact, it is well- 
known that terms like <fi 4 , when added to the Hamiltonian, give contributions 
which are not small, but infinite — hence formulating a well-defined interacting 



1 See Marsden and B atin (1QQ4 V Woodhouse (1991) for discussions of infinitc-dimcnsional 



classical mcchanics, and Wald (199^ ) for a careful discussion of quantising linear field thcorics. 
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quantum theory is actually very subtle. In fact, one approach would be to say 
that the only quantum theories we understand well enough for conceptual study 
arc the frce-ficld ones, and continc our attcntion to thosc. 

In this paper, however, we shall take a more liberal attitude. Therc is actu- 
ally a well-defined approach to understanding thesc apparent infinitics, workcd 
out primarily by Kenncth Wilson and originating in solid-statc physics. In Wil- 
son's approach, we postulate that QFTs do not after all have infinitely many 
degrees of freedom; rather, some unknown processes cut off the high-energy 
degrees of freedom and leave only finitely many to contribute to the physics. 
It thcn turns out — rather remarkably — that all interaction terms in the 
Hamiltonian will fall into two categories. Non-renormalisable interactions will 
be negligibly weak on energy scales far lower than the cutoff threshold. Renor- 
malisable interactions are not necessarily negligible, but at low energies they 
are affected by the choice of the cutoff only through modifications ( "renormali- 
sation") of the parameters in the interaction terms. Since these parameters are 
in any case only known through experiment, the choice of the cutoff bccomes 
irrclcvant to the low-energy regimes of the QFT. 

Solid-state physics provides an example of this process. If we study a solid- 
state system on length-scales which are large compared to the interatomic spac- 
ing, we can approximate the possible (classical) configurations of the àtoms by 
a continuous function — and thus approximate the system by a continuous field 
theory. In quantizing this theory we find that interaction terms lead to infinitics, 
but these are an artefact of our continuum assumption. Once we introduce a 
cutoff banning excitations of the system which vary significantly on length-scales 
short in comparison with the interatomic separation, the infinities vanish. 

Because we are understanding field theories in this way, we can take a re- 
laxed attitude to the infinite-dimcnsional spaccs which we will cncountcr in our 
analysis: such spaces are 'really' finitc-dimcnsional, with the very short-distance 
excitations disallowcd. As for the interaction terms, we will not have need of 
their specific forms. We shall just assume, where necessary, that such terms are 
present but that the theory has been renormalised and that, after renormali- 
sation, the interaction terms can be treated perturbatively. For details of the 
mathematics of this process, see Peskin and Schrocdcr (1995 ) or any other QFT 
textbook; for a conceptual discussion see IWallacc (2001b| ). 



2.4 Problems with a particle interpretation 



The theory constructcd abovc is undcniably a field theory, in the sense 
that its configuration space, and fundamental observables, are inherently field- 
theoretic. It is, however, tempting to try to reinterpret the theory so as to make 
direct contact with the particle concept, either by establishing some kind of 
'duality' between field and particle descriptions (in the same sense that there is 
a duality between position and momentum representations in ordinary quantum 
mechanics, with neither representation being privileged over the other) or by 
replacing the field description entirely with a particulate one (in which case, 
presumably, the field observables would just count as auxiliary constructions of 
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no direct physical significance) . 

Thcre are however, many problems which emerge as soon as we try to intèr- 
pret any QFT so as to incorporate partides at a fundamental level: 



The 'elementary partides' of particle physics arc generally undcrstood as 
pointlike objects, which would seem to imply the existence of position 
operators for such partides. However, if we add the requirement that 
such operators are covariant (so that, for instance, a particle localised at 
the origin in one Lorentz frame remains so localised in another) , or the re- 
quirement that the wave-functions of the partides do not sprcad out faster 
than light, then it can bc shown that no such position operators exist. (See 



Halvorson and Clifton (2001), and references therein, for details.) 



In non-relativistic quantum mechanics. it is straightforward to construct 
Hamiltonians which describe partides interacting via long-range forces 
(for a simple example, consider two charged partides interacting via a 
Coulomb force). However, the concept of a long-range interaction prima 
facie requires some sort of preferred reference frame, which seems to cast 
doubt upon the possibility of constructing such an interaction in a rela- 
tivistically covariant way. 



• As was mentioned in section 2.3, if interactions are present in a QFT thcn 
it is necessary to work, not with the bare parameters in the Hamiltonian, 
but with 'renormalised' parameters — and the parameters which must be 
renormalised include some of those, such as charge, which are generally 
taken to be intrinsic properties of partides. However, there is no priv- 
ileged way of renormalising the parameters, so that the vàlues of these 
parameters — and hence, the natures of the partides which they purport 
to describe — can be in part a purely conventional matter. 

• When we consider quantum field theory on a general spacetime back- 
ground, there is no unique procedure to define partides, and states which 
appear particulate in one reference frame do not do so in other refer- 
ence frames. For instance, consider the so-called 'Unruh effect', in which 
the Minkowski vacuum of a free QFT looks like a thermal (hence, non- 
particulate) state to a uniformly accelerating observer. In this example 
it may be possible to argue that non-inertial observers' descriptions are 
somchow less fundamental, but in a less symmetric spacetime there will 
be no preferred class of observers available, hence no preferred definition 



of particle. (For a more detailed account of this point, see Wald (1994 ), 



who advocates abandoning the particle concept as a consequence.) 

Not all have abandoned partides as fundamental in view of these difficul- 
ties: Fleming has given a strong defence of the idea that particle localisation 
does indeed make sense in relativistic QFT (see Fleming (1996), Fleming anc{ 



Butterfield (1999), and references therein) and Weinberg's recent QFT text- 



book (Weinberg 1995) cxplicitly begins with partides and construets the fields 



as auxiliary objects. However, the general consensus in QFT (insofar as such 
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issues are ever explicitly addressed^) appears to be that the subject is primarily 
about quantum fields. In fact, much modern research in the field only really 
makes sense from this vicwpoint: for example, consider lattice quantum chro- 
modynamics (which attcmpts to understand quark confincmcnt and the exis- 
tence of protons and neutrons, but is formulated in terms of field configurations 
and makes only limited contact with the elementary heuristic that a proton is 
'just' three partic ulatc quarks bo und togcthcr); or consider the quantum Sinc- 



Gordon equation ( Colcman 1985 ), which has two distinct particle descriptions 
(one fermionic, one bosonic) with the weak-field version of the one equivalent 
to the strong- field version of the other). 

Of course, none of this is to deny that partides exist, merely that they are 
not part of the fundamental ontology of quantum field theory. In the next 
section we will consider how it might be possible for the particle concept to be 
recovered from a field-theoretic description. 

2.5 The particle as emergent concept 

It is a central result of condensed-mattcr physics that. if we start with somc 
macroscopic collection of nonrelativistic partides close to some collective stable 
state, small excitations from that state can often be treated in terms of creating 
'partides'. It is also generally true that, for strongly interacting systems, these 
'partides' do not coincide with the partides from which the system is built: so 
vibrations in a crystal are described in terms of 'phonons', which are not crystal 
àtoms, and quantized waves in a magnet are described in terms of 'magnons' 



which are not iron àtoms (Kittcl and Fong 1987) 



There are striking formal parallels with quantum field theory: in fact, the 
construction of phonons from a monatomic crystal is virtually the same as the 
construction of particle statcs in a massless, scalar quantum field theory. The 
diffcrcncc is, the ontology of a crystal is not in question. It is definitely made up 
of the lattice àtoms - which correspond to the field states at different space points 
in scalar QFT. Nonetheless many phenomena can be described by regarding the 
crystal as a gas of phonons, and some — e. g. heat transport — require us to 



think in terms of localized phonons ( Kittcl 1996 ) 



There is nothing particularly paradoxical about this: the crystal isn't 'really' 
a gas of phonons, it's just that certain states of the crystal have properties very 
similar to such a gas, and that treating these states as such is a great boon to 
analysis of crystal dynamics. This puts phonons and their ilk in good company, 
for a great many objeets in seience — such as animals, or rigid bodies — have 
to be understood in the same way. There are no perfectly rigid bodies, for in- 
stance (and they are certainly not part of the bàsic ontology of any fundamental 
physical theory), yet certain states of a many-particle system approximate the 
behaviour of 'ideal' rigid bodies extremely well, and so deserve the name. (See 



Wallace (2001a) for a more detailed discussion of this point.) 



We shall adopt the same attitude to the partides of relativistic quantum 



Sec (Wilczck 1999) for an explícit statement of this consensus. 
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field theory: that is, we shall look for subspaces of the QFT Hilbert space in 
which the states have particulate properties. This will require us to formulatc 
a dcfinition of 'particle' and then to show that there are states of the QFT 
which approximately satisfy that dcfinition; the rest of the paper is concerned 
with this task. First, though, we need to consider in which situations we would 
expcct a QFT to appear particulate. 



2.6 Particle regimes 



The phenomcnology of quantum field theory suggests two regimes in which 
we expect particle behaviour: 

• The non-relativistic limit, in which the QFT appcars to be dcscribcd 
by slow-moving partides interacting by long-range forces; 

• The scattering limit, in which partides begin widely separated, interact 
by short-range forces, and at late times are again found in widely separated 
states. 

We shall be concerned almost exclusively with the second case, for reasons of 
mathematical tractability rather than on conceptual grounds: the analysis of 
relativistic fields via the methods of scattering theory is fairly well understood, 
whereas the process by which nonrelativistic quantum mechanics emerges as 
a limiting case of QFT is much more complicated. In the case of scattering 
theory, though, at times sumciently long after (or before) the scattering event, 
the theory becomes very well approximated by a free quantum field theory. 
(This is intuitively plausible since for scattering theory to be applicable in the 
first place it is necessary that the nonlinear terms in the Hamiltonian constitute, 
after renormalisation, only a small perturbation to the free-ficld theory; for a 
much more careful discussion and justification, sec Haag 1996| .) 



For this reason, our analysis henceforth will be restricted to free quantum 
theories (more specifically, to quantum theories of the form (Q); this includes 
some sorts of background-field interactions). 

3 Defining partides 

In this section, we shall work out a dcfinition of what properties a family of 
QFT states ought to have in order to count as 'particle' states. Since the idea 
of 'particle' is plainly at least connected to the concept of a localised state, we 
begin by considering how the latter states are to be defined in QFT. 



3.1 Localised states in a field ontology 



Which ficld-theory states are to count as localised? 

In a QFT the idea of localisation must enter through the spatial localisation 
of the observables. The observables of the theory are defined via the field oper- 
ators </>(x, t) and 7r(x,í), so it is natural to define any given observable at time 



t as being localiscd in a spatial region Ej Ç £ ifï it is a function only of field 
operators of form 0(xj, t) and 7r(Xj, t) with all of the Xj in Ej. 

But if defining localised observables is straightforward, defining localised 
states will prové dccidcdly less so. We might begin by trying: 

Naive localisation: A state 1^) is localised in a spatial region Ei 
iff (0| O \ <f>) = for any observable O localised outside Ej. 

This sccms plausible when we compare it with the classical case: there a state 
is localiscd in Ej if 7r(x) = 4>(x) = for any x ^ Ej. But it is mathcmatically 
impossible for any state to satisfy it, for it implics that for any such x, and for 
any n G Z + , 

M$ n (x>*)M = (^|r l (x,í)|^) =0. (12) 

But this would imply that \ip) was a simultaneous eigenstate of 7?(x) and 0(x), 
and these operators have no eigenstates in common. (The mathcmatics, bar 
some need to regularise to deal with operators defined at a point, is the same as 
for the nonrelativistic operators X, P, which are well-known to have no eigen- 
states in common.) 

Physically it is easy to see what is happening here. The vacuum state of a 
field thcory (which wc will dcnotc by |íí)) is not 'nothingncss', or 'empty space'; 
it is simply a slightly colourful way of dcscribing the ground state of the field's 
Hamiltonian. In solid-state systems (which, recali, we are treating as field- 
theoretic systems like any other) this state is just the zero-temperature state of 
the sòlid, in which the àtoms will not be at rest but will have zero-temperature 
fluctuations; the same will be true for the field excitations of a rclativistic field 
theory. 

This suggests, however, an alternative definition, first proposcd (for space- 



time regions 0, not spatial regions Ej) by Knight (1961) 



Knight localisation: a state is localised in a sjpacetime region O 
iff (tfi\ A \(f>) — (fi| A |fi) =0 for any observable A localised outside 
the light cone of O. 



It is possible to find states satisfying this criterion (Knight 1961): take any 
unitary operator U localised in O, then the state U |íï) will be Knight-localiscd 
in O. 

However, Knight localisation diffcrs in onc important respect from the sort 
of localisation which we encounter in NRQM. In the latter, properties like 'is 
localised in C are treatable in the same way as properties like 'has energy £" 
or 'has momentum less than p': that is, we can define a projection operator 
whose intended interpretation is 'localised in C, whose range is the space of all 
such states. This would be possible for Knight-localised states iff they form a 
subspace: that is, iff any superposition of two states Knight-localised in O is 
also Knight-localised in O. 

The fact that Knight-localised states do not have this property is a conse- 



quence of the Rcch-Schlicdcr thcorcm (Rech and Schlicder 1961) 
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Reeh-Schlieder theorem: for any region O, the set of all states 
generated by the action of operators localised within O upon the 
vacuum, spans the Hilbcrt spacc of the QFT. 



(For a proof, and further discussion, see Haag 199É.) It follows^ from the Reeh- 



Schlieder theorem that states Knight-localised at O span the entire state space, 
which rules out any possibility of a projector mcaning 'localised with certainty 
in 0\ 

It is easy to see — again by analogy to the sòlid state — why these problcms 
oceur. For in a genèric solid-state system, àtoms are couplcd to their ncighbours, 
and as a consequence the ground state of the system is highly entangled. This 
allows us (in principle) to exploit the long-range correlations between spatially 
separated subsystems of the field to produce any state by local operations within 
O. 

(To see this process in a far simpler system, consider the four- dimensional 
Hilbert space TÍa ® Hb, where Ha and Hb are each one-qubit (two-state) 
systems. The entangled states 

|£b> = -L(|l>®|l>±|0>®|0>) (13) 

are totally indistinguishable from one another when restricted to either sub- 
system (they both induce the reduced state p = |(|0) (0| + |1) (1|) on each 
subsystem) but their sum -l=(|</>+) + \(f>—) = is clearly distinguishable 

from both of them on either subsystem. Examples of this kind are analysed in 



rather more detail by Redhead (1995) and Clifton and Halvorson (2001).) 

Howcvcr, in practice the correlations due to vacuum entanglcmcnt usually 
drop off fast enough that using Knight-localised states to approximate states 
localised far from O requircs prohibitivcly high-energy states. We can then use 
the following pragmàtic criteria to characterise locality: 

1. Effective localisation (qualitative form): A state \ip) is 
effectively localised in a spatial region Sj iff for any function / of 
field operators 0,vf, (ip\ f \ip) — (0| / |Í2) is ncgligibly small when / 
is evaluated for field operators outside S,, compared to its vàlues 
when evaluated for field operators within Ej. 

2. The effective localisation principle (ELP) (qualitative 
form: A subspace Ti of the QFT Hilbert space Tís obeys the ELP 
on scale L iff for any spatial region S large compared with L, a 
superposition of states effectively localised in S is effectively localised 
in effectively the same region. 



3 To see that it follows, we need only note that the unitary clements of a (boundcd) operator 
àlgebra A(0) span .4(0). This can be proved as follows: for any boundcd Hermitian element 
H of A(0), and any t ^ 0, (ií) _1 (cxp(ií/í) — 1) is a linear combination of unitary elements of 
A(0). As í — » 0, this sequence tends to H, hence H is in the span of the unitary operators. 
To complete the proof, simply recali that any linear operator can be written as A + iB, where 
A and B are Hermitian. 
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These qualitative notions can be made precise in a number of ways, such as: 



1. Effective Localisation (quantitative form): A state is L- 
localised in a region Ej, iff for any function / of field operators 
4>,tt, (tp\f\ip) — f \íï) falls off for large d like (or faster than) 
exp(— <i/L), where d is the distance from E^ at which the function 

(/) is evaluated. (Note that there is no diffcrcnce, according to this 
dcfinition, bctwccn a state L-localised at some spatial point x and 
a state L- localiscd in a region of size ~ L around x.) 

2. ELP (quantitative form): A state obeys the ELP on scale 
L iff, for any 3-spherc S of radius > L, a superposition of states 
L-localised in S is i-localised in S. 

A subspace of states for which ELP holds on scale L can be treated - 
approximately — as possessing a well-defined concept of localisation and of 
"localiscd in Ej" projectors for regions large compared with L (these are con- 
structed, for each such region Ej, by taking the projector onto the set of all 
states in Tí which are effectively localiscd in Ej; because of ELP, this set must 
be a linear space). Effectively, in such a subspace we are excluding enough states 
that for any sufhciently large Ej, we cannot construct states localised far from 
Ej using only those states localised within Ej. 

It is still reasonable to ask: what good is effective locality? A state effec- 
tively localised in A can still in principle be distinguished from the vacuum 
via measurements made arbitrarily far away from A. This question lies rather 
outside the scope of this paper (see Halvorson and Clifton (200l[ ) and Wallace 



(2001b| ) for further discussion). Here we note only that such problems are by 



no mcans new to rclativistic quantum thcory. Evcn in non-relativistic quantum 
mechanics, there are in general no states which remain exactly localised in a 
finite region for any finite period of time — yet this does not seem to get in the 
way of the concept of localised particle in NRQM. 

For the purpose of this paper, we shall treat effective localisation as 'good 
enough', and (since no particularly useful concept of exact localisation exists) 
will often drop the word 'effective', treating effectively localised states simply 
as localised. 



3.2 What is a quantum particle? 



Granted that a quantum field thcory must be treated as being fundamcntally 
about fields, what properties must a given state of a quantum field theory have 
in order to be deemed a particle state? It is instructive to start by considering 
the classical case: which classical field configurations (if any) could be described 
as partides? Here the answer seems obvious: the 'particle' configurations will 
be field configurations which are localised in a fairly small spatial region - 
localized blobs of field, in fact. Translated into quantum mechanics, this would 
make 'partides' just another name for the effectively localised states of the last 
section, provided that they were localiscd to sufficiently small regions. 
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However, this classical concept of particle is in one sense too weak to be 
appropriate for quantum theory. Classical wave-packets tend to spread out 
with time, becoming less localised — and hence, less 'particulate', whereas in 
non-relativistic quantum mechanics a state describing n partides at time t will 
continue to describe n partides at all other times — and even in relativistic 
quantum mechanics we wish to recover a notion of particle which is robust and 
time-independent provided the partides are far away from one another.^] 

Furthermore, the criterion that partides should be localised is in some sense 
also too strong for quantum mechanics. As the two-slit experiment reminds us, 
it is easy for a particle to enter a state which is nowhere near an eigenstate of 
position — in other words, nowhere near localised. 

However, the two-slit experiment also suggests the correct quantum defini- 
tion of particle. Although the experiment shows — by demonstrating interfer- 
ence of the particle wave — that a classical-particle picture isn't viable, it also 
shows that a classical-wave picture isn't viable either, because on measurement 
the particle is always found to be localised somewhere. To ensure within the 
formalism of quantum physics that this happens, it is enough to require the 
particle to be a linear superposition of states all of which are localised — then 
any measurement of particle position will always give a single answer. (I stress 
that this is intended to be an essentially interpretation-independent statement: 
I am not addressing the measurement problem here.) 

These observations motivate our definition of a quantum particle:^] 

A space of one-particle states of size L (where L is small), written Hip, is a 
subspace of the QFT Hilbert space Tís such that 

1. There is a basis for Hip, each member of which is a state i-localised at 
a point; cquivalently, all states in Tíip are linear superpositions of such 
localised states. 

2. 7i\p satisfies the effective localisation principle on scale L. 

3. Tíip is effectively preserved, on relevant timescales, by the dynamics of 
the field theory. 

This definition is intcntionally somcwhat vague. The imprecision of the third 
criterion mirrors the way in which quasi-particles arisc in solid-statc physics — 
often the quasi-particles spontancously decay, so that the one-particle subspace 
is not exactly preserved by the dynamics. However, provided that the decay 
time is long compared to other relevant timescales (such as the time taken 
by the quasi-particles to move between col·lisions) then the quasi-particles will 



4 We can find classical field theories 



hi eh conta in st ates likc thesc — the solitons of the 



sine-Gordon equation are one example (Colcman 1985) — but in general they oceur only 



in strongly non-lincar theories, whereas here we are concerned with linear or ncarly linear 
theories. 

5 It should be noted that this definition is closely related to the definition used in algebraic 
QFT, in which an n-particle state Ja deflncd as one which is able to trigger up to, but no more 
than, n detectors at a time. See 
definition. 



Haag (1996, section II. 4 and chapter VI) for more on this 
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provide a useful concept with which to describe the field theory. As the decay 
time decreases there will come a point at which this concept ceases to be useful, 
but it would be a mistake to try to define this point exactly. 

Wc have also made no attempt to be precise about the phrase 'where L is 
small': how small is small? In non-relativistic quantum mechanics, the answer 
is 'arbitrarily small': a (possibly overcomplete) basis can be constructed from 
states cffcctivcly localised in arbitrarily small regions of configuration space. 
(The set of all Gaussians of an arbitrary fixed width, for instance, will do nicely.) 
It will turn out, however, that this is not possible in quantum field theory: here 
there will turn out to be a minimum realizable size. It is reasonable to think of 
this as giving the 'size' of a particle: a particle's size is the size of the smallest 
region in which it can be localised. 

Is it justifiable to be this vague in our definitions? A robust answer would 
be 'it works for quasi-particles, so why not?' More satisfactorily, we can recali 
that we are not looking for partides which can be added to the bàsic ontology 
of our theory (which, granted, does need precise definition); the bàsic ontology 
is and remains states of Ws, or cquivalently, wave-functionals on S. Rathcr, 
we are just finding a good way to characterise certain states with interesting 
properties. Provided these states are picked out very accurately, there is no 
need to worry if the aceuracy isn't perfect: we are simply looking for aceurate, 
robust schemes by which we can approximate the dynamics of the theory and 
explain phenomena. (For a more extended, and somewhat more philosophical, 
defence of this use of approximate concepts in physics, see Wallace 20014 ) 

In any case, it is the existence of an Tíip simultaneously satisfying (1), (2) 
and (3) which is in need of explanation. A space satisfying any given one of 
these clauses would not be particularly remarkable: for instance, given any 
collection of localised states we could construct a space satisfying (1) by taking 
their span, but then this space would not generally be preserved under time- 
evolution; or we could construct a space satisfying (3) by taking the collection 
of all states which are time-evolutes of our given collection, but then generally 
not all such states would be linear superpositions of members of the original 
collection. Furthcrmore, if our system satisfied (1) and (3) but not (2), we would 
have no guarantee that the concept of localisation would work for our partides 
as we need it to do in non-relativistic quantum mechanics and in scattering 
theory: specifically, we would have no guarantee of the existence of projections 
onto partides in a specific location. 

In the next three sections, we will go about constructing states which fit the 
definition of a particle given above. Bcforc embarking on this task, though, we 
should address an obvious objection: that we know perfectly well which states 
of a free QFT are the one-particle states, so all that is left to do is verify that 
the definition holds for these states. 

The results of the ensuing calculation would, of course, confirm that free 
QFTs have one-particle sectors; however, it would not really answer the question 
of why they do. The more indirect approach used here is intended to give some 
insight into this second question. 
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4 Modal analysis of a free field 

This section is a mathematical analysis of the structure of classical linear 
field theories; it is a common 'building block' for the two methods of reaching 
the one-particle subspace which will be developed in sections [| and ||. 

For the sake of mathematical rigour, this section makes some use of distri- 
bution theory (all such material can safely be skipped by any reader who does 
not get nervous upon sighting a Dir ac delta func tion). The notation and ter- 



minology used is essentially that of Rudin (1991 ), especially chapters 6-7; in 



particular, use is made of Rudin's elegant 'multi-index' notation, in which 

• an index a stands for an ordered rt-tuple (ai, . . . , a n ) with G Z + : 

-D* ■= (afr) Ql ···(^r; 

• \a\ := ai H h a n . 

4-1 Required properties of 1Z 

Recali that the free-field theories we are considering have the field equation 
©,i.e. 

d 2 

^(x,t)+7ty(x,i) =0. 

We begin our analysis with a technical digression onto the operator 1Z in this 
field equation. Specifically we will require the operator to have the following 
properties: 

1 . 1Z is a continuous linear map from C°° (S) , the space of real smooth func- 
tions on S, to itsclf.[] 

2. 1Z can be extended to a self-adjoint operator on (a dense subspace of) 
the space L 2 (E) of square-integrable complex functions on S. (We shall 
identify 1Z with its self-adjoint cxtension). 

3. 1Z is a local operator, in the sense that lZf(x.) depends only on the vàlues 
of / in an arbitrarily small neighbourhood of x.Q 

4. The spectrum of 1Z is known to be real, since it is self-adjoint; we shall also 
require it to be positive and to be bounded bclow by a strictly positivc 
cigcnvalue. (In other words, zero is not an eigenvaluc of 1Z\ hence, 1Z is 
invertible) . 



^'Continuous' means 'continuous with respect to the topology nn <7°°(yi) indu ced by the 



family of semi-norms Pjv(/) = sup{|D a /(a;)| : x £ S, |a| < A^}'; see Rudin (1991, pp. 34—36) 

for more on such topologies. 

7 Given the short-distance cutoff introduced in section \l. ] to make mathematical sense of 
interacting QFTs, the requirement of exact locality is not really necessary: it is enough to 
require that Tíf(x) depends significantly on the vàlues of / only in a neighbourhood of width 
~ _Lcut> where L cu ^ is the cutoff lengthscale; anticipating the later results of this section, this 
is to require that TZ is L cu t-local. 
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If the spectrum of 1Z is discrete, 1Z must have a complete set of eigenfunc- 
tions, orthonormal in the L 2 inner-product 



(4>,r/>) = / d 3 x^(x)^(x); (14) 



E 

we will denote a given such set as /fc(x). (Note that since 1Z is both real and 
self-adjoint we can always choose its eigenfunctions to be all real, though we 
shall not always do so.) The eigenvalue of fk is denoted w 2 ., with ujk > 0. 

We will, in fact, take a somewhat schizophrenic attitudc towards the dis- 
creteness (or otherwisc) of the spectrum of 1Z: for conceptual analysis it will 
usually be convenient to take it as discrete, but in practical applications we 
will often want to take 1Z to be a differcntial operator on R 3 , in which case the 
spectrum is necessarily continuous. We shall thereforc take the usual (if some- 
what non-rigorous) physicist's step of assuming that moving from a discrete to 
a continuous spectrum is a purely technical matter involving no change in the 
conceptual situation. 

Now, let x be any point on S; then we can define a linear functional 7Z X on 
C°°(£) by 7Z X ■ / = (7Zf)(x); the continuity of 1Z means that 1Z X is continuous, 
hence is a distribution (generalised function) over C°°(E). The following results 
are easy consequences of distribution theory and of the locality of 7Z: 

1. Because 1Z is local, each 1Z X has support {x}. 



2. From thcorem 6.25 of ( Ftudin 1991 ) we can deduce that (in a local chart 
at x), we can find constants c a and N such that TZ X = X)| q |<tv c aD a 6 x , 
where S x is a Dirac delta at x. 

3. From the continuity of TZ, it follows that, in any local chart, we can find 
functions c a (x) such that, for any x in the chart, 7Z X = 5Z| a |<jv c a( x )^ a ^x- 

4. From this, we deduce the (fairly obvious) fact that 1Z is a differcntial 
operator. 

If we follow the usual fiction of treating distributions as functions, we can 
(formally) define a function 7?.(x, y) by 

jíd 3 y^(x > y)/(y)=Wx·/; (15) 

hence 

(W/)(x)= / d 3 y^(x,y)/( y ). (16) 
Js 

Again formally, wc can think of this function as giving the matrix elements of 
1Z in a position basis, provided we remember that these clements are derivatives 
of delta functions. 

It follows from the spectral theorem that 

^(x,y)=]T^/ fc (x)A*(v), (17) 
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and that the kernels 72 A (x, y) of the operators 72 A are given by 



72 A (x,y)=^f/ fc (x)/*(y). (18) 



(Again, these kernels may well be delta-functions or othcr such distributions; 
thcy are not necessarily well-behaved functions.) 

Fractional powcrs of 72 will become important later in the paper, and in gen- 
eral such operators will not be exactly local even if 72 is (the operator y/m 2 — V 2 , 



for instancc, is known (Goodman and Scgal 1965) to be anti-local, in the sense 
that for any function /, supp\/?n 2 — V 2 /Usupp/ is all of space except possibly 
for a set of points of measure zero) but they may be 'approximatcly local'. Wc 
dcfmc 'approximatcly local' as follows: 

An operator 72 is L-local iff its kernel 72(x, y) drops off like cxp(— |x — y\/L) 
as |x — y| becomes large compared with L . 

Informally, this means that while 72./ (x) does not just depend on the vàlues of 
/ in an arbitrarily small ncighbourhood of x, it does depend significantly on 
the vàlues of / only in a neighbourhood of width ~ L. Note that there is a 
certain looseness in the defmition (in the phrase 'large compared with'); purely 
mathematically, we could replace this with 'as |x — y| — > oo' but clcarly it would 
be against the spirit of the definit ion for (say) the kernel to start dropping off 
only once |x - y| > 10 30 L. 

In the next section wc will prove approximate locality for an important 
subclass of 72 operators. 



4-2 Euclidean-invariant 72 



In this section we will consider an important sub-class of 72 operators: those 
which act upon R 3 and which are invariant under spatial translations and ro- 
tations. In this context we can establish the approximate locality of the 72 A 
operators. 

The reason for requiring translation invariance is that we can work in Fourier 
space: any translation-invariant operator must have the exponential functions 
(2^)3/2 exp(ik • x) as its eigenfunctions and so from ( |Ï7| ) we must have 

72(x, y) = J d 3 k exp(zk • (x - y) V 2 (k). (19) 

Since 72(x, y) is a rotationally-invariant sum of derivatives of delta functions, 
it follows that the function w 2 (k) is a polynomial in k • k. Formally, then, the 
integral ( |Ï9|) can be transformed to 

72(r) = ; \ [ dkkoj 2X exp(ikr) (20) 
J R 

where r = |x — y| and where we have replaced 72(x, y) with 72(r) to indicatc 
that 72 depends on x and y only through r. For positive A at least, this integral 
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is divergent, indicating that TZ{r) is distributional; however, if the spectrum is 
unbounded then for sufficiently negative À then the integral becomes convergent. 

Now, general powers of polynomials are holomorphic except along branch 
cuts from the zeroes of the polynomial (for details, sce any introductory complex 
analysis textbook; Priestlcy (1989| ) gives a clear exposition); furthermore, by 
assumption (4) of section 4.1 wc know that the function ui 2 (k) has no real 
zeroes. We are then able to evaluate (20) by contour integration, as follows 



(this is a simple gene ralisation of Standard methods in QFT; see, e. g. , Pcskin 



and Schroeder (1995 ) for a presentation of the method for 1Z = m 2 — V 2 ): 
we construct branch points from each zero in the positive half-plane upwards 
paral·lel to the imaginary axis, and from each zero in the negative half-plane 
downwards paral·lel to the imaginary axis. We then deform the contour upwards 
towards -Moo, wrapping it around each branch cut in the upper half-plane in 
the process. The integral is then a sum of integrals along both sides of each 
branch cut in the upper half-plane. 



Branch cut > 
Zero of u)' 2 



Contour of 



Deform contour 
upwards 



integration 



Now, suppose that there are TV such branch cuts, starting at points (uí,vï) 
in the upper half-plane. Each integral along such a cut takes the form 

exp( — VíV^) /*°° 

It = \ n s 7 7 / dp(ui + i(vi+ p))fi(p)exp(iv,ir)exp(-pr), (21) 



(27T) 

where fi(p) = lim e _ to ([j 2A (w.j + e, Vi + p) - cü 2X {uí — e, Vi + p)). As r — > oo (or, 
more specifically, as r grows large compared to each 1/ui) the sum of all such 
terms will be dominated by Ii Q1 where Vío is the smallest of the Ví; in turn, for 
large r Ii will become proportional to exp(— w Jo r)/r. 

This cstablishcs that for sufficiently negative A, 1Z X is a l/ui -local operator 



in the sense of section |4.1| . Since any 1Z X can be written as 7Z x ~ n 7Z n for arbi- 
trarily large n, and since each lZ n is strictly local, it follows that all powers of 
1Z are (at least) l/t^-local. 

In the case of Klcin-Gordon theory, where 1Z — m 2 — V 2 , the only zero of 
the spectrum in the upper half planc is at (0, m), hence powers of 72. arc 1/m- 
local in this case. 1/m is generally referred to as the Compton wavelength; we 
will extend this term to all translation-invariant 1Z, and define the Compton 
wavelength L c of such an 1Z as equal to v; . 



Given these results, in addition to axioms (1-4) of section 4.1, we will require 
1Z to satisfy one of the following: 

5a. For all A, 7Z is L r -local for some L r \ or 
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5b. 1Z is rotationally and translationally invariant. 

Of course, 5b implies 5a. 

It might appear that solid-state systems do not satisfy 5b since the lattice 
structure violates translational and rotational invariance, but in fact the lattice 
only enters the observable results of the theory by imposing a short-distance 
cutoff, and hence (provided we work at lengthscales large compared with the 
cutoff) most solid-state systems may be treated as satisfying 5b. 

4-3 Modes of the free field 

Recali how to solve the free-field equation (Q) by separation of variables: we 
try an ansatz of form ^(x, t) = A(x)B(t); this gives 

A(x)B(t) + B(t)TZA(x) = 0. (22) 

Dividing through by A(x)B(t) splits the equation into two terms, one indepen- 
dent of x and the othcr of i; this means that the equation can be solved only 
by finding solutions to the paired equations 

B + lu 2 B = 0; (23) 

1ZA = ív 2 A (24) 

where u> is to be determined. The second of these is simply the eigenfunction 
equation for 1Z. Each mode will have either exponential decay/growth (for 
uj 2 < 0), or sinusoidal variation (for lj 2 > 0), in time; our restriction to positivc 
1Z eliminates the former case (this is the reason for this restriction) and we are 
left with a set of solutions of the form 

0(x,í) = / fe (x)cosKí) (25) 

and 

4>(x,t) = / fe (x)sin(cJ /c í). (26) 

(For the Klcin-Gordon equation, the /k are just proportional to sine and cosine 
functions sin(k ■ x), cos(k ■ x), with the possible vàlues of k constrained by the 
boundary conditions and with u 2 = m 2 + k • k.) 

An arbitrary solution of the equations can be expressed as a sum of solutions 
of this form: 

0(x,í) = V* — — (çfc/fc(x) cos(wfcí) +p k f k (x) sin(wfcí)) , (27) 

— * . /t,1i- 



/Lük 

so that a solution is given by the collection of real numbers (qk, Pk)- 

Since the spacc of solutions to the field equations is in one-to-one correspon- 
dence with the phase-space V (via 0(x) = (j>(x, 0), 7r(x) = </>(x, 0)) we can regard 
(lk,Pk) as coordinatizing V: to be specific, we have 



Y,4=Qkfk(x) (28) 
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and 

TT(x)=J2V^Pkfk(x). (29) 
k 

In fact, the choice of \fü k factors in ( p7| ) means that they are canonical 
coordinatcs, in the sense that they obey the Poisson-bracket relations {q kl q k >} = 
{PkiPk 1 } = 0; {qk,Pk'} = Sk,k' (the proofs are straightforward and make use of 
the orthonormality of the f k ). In these coordinates the Hamiltonian (||) becomes 

R=^J2^(pl+ql). (30) 

k 



Thus, subject to our restrictions on 1Z at the start of section 4.1, any free-field 
theory can (as is of course well-known) be expressed as a sum of independent 
harmònic oscillators. 

If we define a k = -^(Qk + íPk) we carL rcwrite (|27j ) in the alternative form 

= V — == (a k f k (x)cxp(iuj k t) + a* k fk{-x)exp(-iu) k t)) . (31) 

In fact, there is no real reason to restrict to real-valucd cigcnfunctions f k : the 
expansion ( |3l·| ) is just as vàlid for complex cigcnfunctions. Inverting it gives 

a ^ = ^J s d 3 x fl (x) (y/UïMx) - *^^( x )) > ( 32 ) 

which (following the definition a k = ^(Qk + ipk) in the real-/fc case) suggests 
taking 

q k = d 3 x (^V^0(x)Re/ fc (x) + -L=^(x)Im/ fc (x)^ (33) 

and 

Pk = £ d 3 x (^-V^^(x)Im/ fc (x) + -^7r(x)Re/ fc (x)^ (34) 

in the complex case. We can readily show that these are still canonical coor- 
dinates; the forms of (^7j-|29|) become slightly morè complicated but H still has 
the form (|3C|). This generalisation is useful in the Klein-Gordon equation, for 
instance, as it allows us to take /k cx exp(zk-x), which is usually mathcmatically 
more convenient than working with sine and cosine functions. 

The coordinate functions q k and p k have very simple time-dcpcndencc: from 
the Poisson brackets, we have q' k = u k p k and p k = —uj k q k . Hence the time- 
cvolution of the system is 

qk (*) = Çfc(0) cos(w fc í) + p k (0) sin(w fe í) ; (35) 

Pk(t) = pfc(0) cos(wfcí) - q k (0) sin(cjfet), (36) 

or equivalcntly 

a k (t) = a k (0) exp(-iw fe í). (37) 
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5 Partides through coherent states 

The first method which we will use to construct particle states begins by 
constructing quantum-mechanical approximations to classically localised field 
states. In fact, it will turn out that these approximations are not partides, but 
they provide a natural stepping stone towards partides. 



5.1 Harmònic oscillator coherent states 



For a linear field theory, note that evcn in thc classical casc there are states 
which are localized to greater or lesser degrees. For instance, plane waves are 
(improper) classical states which are not localized at all, whereas we can con- 
struct fairly localized wave-packets. This is diffcrent from the case of particle 
quantum mechanics, where the classical states are perfectly localized and any 
loss of localization oceurs only at the quantum level. 

One way to construct a localized quantum state might be as follows: we begin 
by choosing a point in the classical phase space which is fairly localized (e. g. a 
fairly compact classical wave-packet) and then try to construct a quantum state 
which is concentrated around this point. Obviously, in the general case there is 
no unique way of approximating a phase-space point in quantum mechanics since 
precise localization in phase-space is not a wcll-dcfined quantum concept. In the 



case of a harmònic oscillator. however, there is a simple prcscription(Glaubcr 
|(1963| ); sec [Peres (1993| ) for a discussion) which gencrates approximations to 



phase-space points. If a is the creation operator for such an oscillator and |0) 
is its ground state, then the state 

\a) = cxp(-|a| 2 /2)cxp(aS t ) |0) (38) 

has the following properties: 

1. It is a Gaussian in both position and momentum space; 

2. It is centred around q = \/2ïlca in position space, and p = \/2lma in 
momentum space; 

3. In both position and momentum space, the wave packet keeps its shapc 
under time evolution (i. e. remains a Gaussian of the same width); 

4. As time passes, the centres of the Gaussian in position and momentum 
space evolve as would the position and momentum of a classical harmònic 
oscillator with the same Hamiltonian. 

Such a state is called a coherent state. Note that because of the one-to-one 
correspondence between phase space and the set of solutions to the dynamical 
equations, and because the coherent states track the classical evolution of phase 
points, we may equally well regard coherent states as quantum approximations 
to classical solutions. 
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5.2 Field coherent states 



Since the free field is (mathematically speaking) a collection of independent 
harmònic oscillators, these coherent states are an appropriate tool to construct 
quasi-classical states. The kth mode has creation operator a k (= — Wk)), 

and hence a basis for the field Hilbcrt space is given by the states created 
by successive actions of the different a\, on the vacuum. 

A state localized around the fcth mode would be 

D k (a) |0) := exp(-|a| 2 /2)cxp(aa[,) l n ) ( 39 ) 

where the classical mode being approximated is (Re a/fc/y/iüJfc, Ima/tyïi). 
Similarly, a classical state made from a superposition of modes may be quantum- 
mechanically approximated by successive application of D k operators to the 
vacuum, and the evolution of the classical state will be tracked by the quantum 
wave-packet. 

It is vital to keep in mind the diffcrences between classical and quantum 
concepts in what we are doing. Remember, we are constructing a quantum 
wave-packet, a complex functional on the space of field configurations, which 
is concentrated around a given point in configuration space. That point itself 
describes a classical wave-packet, that is, a real function on physical, three- 
dimcnsional space. 

As time passes, the quantum wave-packet will not spread out across con- 
figuration space but will move around keeping its shape. Its centre will move 
through the configuration space according to the classical equations of motion, 
which will cntail the spreading out through physical space of the classical wave- 
packet. Thus a coherent state will become less localized in physical space with 
time even though the quantum wave-functional keeps its shape (and, in partic- 
ular, its width) in configuration space. 

Now suppose the classical solution which we want to approximate is 



x, í) = V — == (afc/fc(x) exp(iw fe í) + «fc/fc(x) exp(-iw fc í)) , (40) 




and that it corresponds to the phase-space point (</>, 7r); then the corresponding 
coherent state is 

\C{^)) =\{D k {a k )\ü) (41) 

fc 

(the order in which the D k (ctk) are applied is irrelevant as they all commuto). 
Writing this out explicitly, we get 

\C(M) =n( cx P(-| Q fe| 2 /2)cxp(-a fe a t fc )) |fi) (42) 
fc 

which may equally wcll be written as 

|C(0, tt)) - exp í-i Ç \a k \^j exp (- Ç a k a{j \ü) . (43) 
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All we have used here is the elementary fact — applicable also to commuting 
operators — that a product of exponentials is equal to the exponcntial of the 
sum of their arguments. 

Now if we take (4>,ir) to be an clement of V satisfying J^k \ a k\ 2 = lj we can 
dcfinc 

k 

and 

D [4> ^){z) = cxp(-|z| 2 /2)cxp(zS^ 7r) ). (45) 

then we will have 

|C(^,7r))=S WiW) (l)|Q). (46) 

Acting on the vacuum with D^ ^{z) for higher vàlues of \z\ creates coherent 
states localized around successively larger wave-packcts; in this way, the action 
of D^^(z) on the vacuum as we vary \z\ and hold (</>,7r) fixed will map out 
a collection of states, whosc span is a subspacc of 7Ys- It is casy to see that 
this space can also be spanned by those states generated from the vacuum by 
successive action of the n operator. Structurally therc is a strong similarity 

to the subspaces created by the a\ , although of course this new subspace is not 
preserved by time evolution. 

5. 3 Coherent states are not partides 

Can the coherent states be regarded as quantum partides? Absolutely not. 
Although there are more and less localised non-relativistic quantum partides, 
and more and less localised coherent states, the two forms of localisation are 
wildly diffcrcnt. If (ep, tt) and (0', 7r') are phase-space points localised in diffcrcnt 
regions^] of £ then (<p> + </>', 7r + n') is a classical state which is non-localized in 
the sense that it is an extended field concentrated in two separated regions; if 
there are two spatially separated waves propagating on the surface of a pond 
then the excitations of that surface are in this sense nonlocal. And all coherent 
states are approximations to classical states: a coherent state formed around 
((/> + </>',7r + 7r') is non-localized only in the same sense as its classical progenitor. 

If ip and tp' are localised wave-packets of a quantum particlc, on the other 
hand, then ip + ip' is nonlocalized in a wildly different way. Though we may 
be able to regard ip and ip' as approximately classical (approximating classical 
point partides), we cannot so regard ip + ip' . After all, if the nonlocal nature of 
these wave-packets could be understood in the classical way then the profound 
foundational problems of quantum nonlocality would never have arisen. 

The coherent states offer us the possibility of constructing localised quantum 
states, but they are certainly not partides — localised or otherwise. 

8 Or even if just <f> and <f>' , or just ir and ir' , are localised in different regions. 
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5.4 Two linear structures 



The argument why coherent states will not do as partides can be summed 
up by noting that the following diagram does not commute: 



Localised 
classical 
states (</>„, 7r„) 



Classical superposition 



(E„ C n<P, E» °n^) 



General 
classical 
state (ó,tt) 



Quantum 

coherent 

approximation 



Localised 
coherent states 

|C(</>„, 7T„)) 



Quantum 
superposition 



Superposition 
of coherent states 

J2n C « |C(0„,7T„)) 



Quantum 

coherent 

approximation 



General 
coherent 
state \C((f), 7r)) 



However, the strategy of approximating the classical states gives us a good 
mcthod of constructing quantum states with given locality properties, and it 
would be uscful to preservo that strategy in finding candidatc particlc states. 
Clcarly, what is needed is a replaccmcnt of the 'quantum coherent approxima- 
tion' arrows in the diagram with some other approximation, which still preserves 
localisation but which leads to a commutative diagram. Such a replacement 
would have to be a far weaker concept of approximation than that applicable 
to coherent states, preserving little more than localization properties. This is 
to be expected, in fact — non-local particle states are highly non-classical, so 
cannot be good approximations to any classical field state. 

To find this replaccmcnt, consider the small-state limit of the coherent ap- 
proximation: that is, consider coherent approximations to some state (X(j), Xtt), 
as A — > 0. If (without loss of generality) we take the complex coordinates {a/c} 
of (0, 7r) to satisfy ^ fe \ak\ 2 = 1, thcn from ([45|), we find that such coherent 
approximations are given by 



\C(\<f>, Att)) = exp(-A 2 /2) exp(Aa^ )7r) ) |0) 



|n> + A|0,7r> 



where we have defined 

Now, if the classical state 
region A, then the state \cj), 



\ü) 



(47) 



(48) 



(4>, 7r) is non-zero (or just non-negligible) only in a 
7r) which we have just defined is, plausibly, effcctively 



23 



localised in A: for it is a linear combination of \C(4>, tt)} (which is a coherent 
approximation to (</>, 7r), and thus presumably localised in .4) and |Í2) (which, 
trivially, is localised everywhere). 

Of coursc, plausibility is at the moment all we have: as section 3.1 cxplaincd. 
effectivc localization is not in general a linear propcrty. But in the following 
section we will calculate the actual localization propcrties of \<fi, tt), and find 
that they are indced localized in the same region as the classical state. For the 
moment note that if such states are localised correctly then they are precisely 
what we are looking for — for if we define |fc) 
have (from |J) that 

|&7T>=5>*I*> 
fe 



4 \u) 



\f k /*J2w k ,0), we 
(49) 



From this it follows immcdiatcly that the |</>, tt) states form a subspace, and that 
the linear structure on that subspace mirrors the linear structure of the classical 
states being approximatcd: 



Aa 



t 



5.5 Localization properties of \4>, tt) states 



+ A'a],, ,,. 
(<t>',^') 



(50) 



To investigate the localisation properties of the \(f>, tt), we need to be able to 
calculate the expectation vàlues of produets of 0(x) and 7r(x) operators, and the 
major tool we shall use will be knowledge of the (easily-calculated) commutators 



and 



= i( 7 r(x)- ï (^/ 2 0)(x)). 



(51) 



(52) 



With these known, we can calculate expectation vàlues by the usual method of 
moving the annihilation operators over to the right where they annihilate |Í2). 
For instance: 

(0, tt| 0(x) 2 |0, tt) = (íï| S^0(x) 2 a^ |íï) (53) 



= (Í2|0(x)0(x) |fi) +2 fò(x),aj J f$(x),Sj )W 



(54) 



Subtracting off the vacuum expectation value (which is in general divergent, 
hence depends on whatever high-cnergy cutoff procedure we have chosen to 
use), we get 

(0, tt\ 0(x) 2 |0, tt) - 0(x) 2 |Í2) = 0(x) 2 + (^- 1 / 2 tt)(x) 2 . (55) 
In a similar way we can calculate 

(0, tt\ Tf(x) 2 |0, tt) - (íï| íf(x) 2 |íï) = tt(x) 2 + (^ 1 / 2 0)(x) 2 , (56) 
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M ^(x) 2 + ^(x) (K$) x) |0, tt) - (0| iíf(x) 2 + i^(x) (TZ4>) x) |íï) 



1 - 7 r 2 (x) + Íf^ 1 / 2 N ) 2 (x), (57) 



2 v ' 2 

ctc. (The last expectation value is that of the energy density, i. e. the (0, 0) 
component of the stress-energy tensor.) In each case — and, it is easy to see, 
for any such expectation value — the expectation value is some function of the 
classical fields cf>, ir, modified by the action of some fractional power of 1Z. (In 
particular, the energy density is equal to the classical energy density up to the 



action of such operators.) Given that, at the end of section 4.2. 1Z was required 
to satisfy either axiom 5b (which entails that all TZ X are L c -local for some L c ) 
or axiom 5a (which requircs this by fiat) it follows that 

• If (</>, 7r) is localised in a region A thcn the difference of expectation vàlues 
of |0,7r} falls off like exp(— d/L c ) with distance d from A. 



• Hence, by the definition of effective localisation (in section 3.2), if (</>, n) 
is localised in a region A then |0, ir) is effectively localised in the same 
region. 

• In vicw of the linearity of the map (0,7r) — > \4>,7t) which we have con- 
structed between classical and quantum states, it follows that the space 
of all states ir) obeys ELP on scale L c . 

Notc that (given the definition of L c -localised states) any structure that the 
classical state has on seales smaller than L c is likcly to be disrupted by the 
action of 7Z X ; in particular, for a classical state (0, tt) localised in a region small 
compared with L c , the corresponding quantum state |0,7r) will have little in 
common with the classical state other than being effectively indistinguishablc 
from the vacuum at distances from the classical state which are large compared 
with L r . 



5.6 Partides at last 



Let us review the process we have used to construct the \(f), tt) states. We 
have taken a classical wave-packet and constructed a coherent state around it. 
This state turns out to be expressible as the coherent state generated by a 
singlc creation operator, and in turn the action of that creation operator on the 
vacuum produces a state which is localised in the same region as the classical 
wave-packet (up to variations of size ^ L c ). 

It is now easy to see that the following diagram commutes: 
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Classical 
wave-packet 

(0(X),7T(X)) 



Classical superposition 



= Ea 



(afc/*(x) + «fe/fe (x)) 



fe 

i"( x ) = « Efe («fc/fc( x ) - "fc/fcW) 



Coherent 
quant um 
approximation 



Quantum 
coherent state 
|íï> 



Quantum combination 



D iM \n) = i\ k D k (a k )\n) 



Classical 
modes 
/fe(x) 



Coherent 
quant um 
approximation 



Quantum 
coherent modes 
D k \Sl) 



Restrict to 
action of 
first-order 
component 



of D, 



),7r) 



Restrict to 
action of 
first-order 
component 
of D k 



Localiscd 
particle state 


Quantum superposition 


Particle 
momentum 
eigenstates 

lfc)=4in) 


l<M) = Efe«fe \ k ) 



The important properties of the diagram are: 

1. Moving down the diagram preserves L c -localisation properties. 

2. Moving from the first to the second row takes us from the classical to the 
quantum regime, but does not drastically change the nature of the states: 
states in the second row are good approximations of those in the first row, 



in the sense spelled out in sections 5.1-5.2 This is not true for the third 
row: the only sense in which \<p, ir) approximatcs (</>, w) is that they share 
the same localisation properties. 

Moving leftward across the diagram corresponds to the combination of 
modes to make localised states. The middle (dashed) line is not a linear 
process, but the top and bottom lines both represent linear superposition. 
However, though mathematically very similar, these superpositions have 
physically very different meanings. 

In the quantum superposition process, it is natural to consider complex 
weightings for the states being superposed. The (mathematically) equiva- 
lent process at the classical level provides a generalisation of the real- linear 
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superposition process for classical states, effectively equipping the classical 
solution space with a complex structure (of which more later). 



With these results in hand, it is casy to vcrify that the space of \cj>, ir) states 



is indeed a one-particle space in the sense of section 3.2. The localised states 
are constructed by beginning with classically localised wave-packets and moving 
down the diagram. The requirement that all states in the space are superpo- 
sitions of localised ones follows from the equivalent property of classical phase 
space together with the commutativity of the diagram. The validity of the su- 
perposition principlc among effectively localised states is a trivial consequence 
of the diagram's commutativity. And, crucially, the closure of the one-particle 
subspacc under time-evolution follows once we observe that the \k) are energy 
eigenstates: this means that the projection operator onto the one-particle sub- 
space commutes with the Hamiltonian, so the subspacc must be preserved under 
time evolution. (Equivalcntly, closure under time evolution follows once we ob- 
serve that the map 

(0 j7 r)— >|&tt) (58) 

commutes with time evolution.) 

The essential property of the QFT which makes this whole process possible is 
its linearity: without the linearity, we would not have the classical linear struc- 
ture whose interplay with the linear structure of Tís allowed our construction 
to proceed. 

Hcnceforth, we will denote the space of all \cf>, ir) by Hip. 



6 Development of the particle concept 

In this section we will analyse further the construction of partides presented 
above. We will examine the importance of the Compton wavclcngth, and dc- 
velop the links between the linear structures on phase space and on Hilbert 
space; we will then use this analysis to give an alternative way of constructing 
the one-particle subspacc. 



6. 1 Significance of the Compton wavclcngth 



The results above imply that it is localisation on the scale of the Compton 
wavelcngth L c , and not exact localisation, that is significant for partides. Therc 
is a str aigh tforward physical rcason for the significance of L c : as mentioned in 
section 3.1, the vacuum state of any QFT is entanglcd (in the sense that field 
states in different spatial regions are entangled) and this entanglcmcnt cannot 
be removed from the non- vacuum states of the field without interfering with the 
field's structure at energy levels comparable to the cutoff energy (in other words, 
without going beyond the domain of validity of QFT). However, the correlations 
in the vacuum drop off with spatial distance, as can be seen from calculating 
quantities such as 
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(0| 0(x)0(y) |Í2) - (íï| $(x) |Í2) (íï| £(y) |Í2) 



= l -K- 1 'H{K-y). (59) 

If Hr 1 ! 2 is non- local on lengthscales of ~ L c , then we can treat spatial regions 
scparatcd by distanccs largc compared with L c as uncorrelated, but it makcs 
rathcr little sense to talk about localisation on seales small compared with L c . 

This also gives us at lcast heurístic grounds to extend the concept of the 
Compton wavelength beyond Euclidcan-invariant 1Z. As was shown in section 



4.1, the locality of 7Z requircs it to have forní 



(tt/)(x) = J2 c a (x)D a f(x), (60) 

\a\<N 

with Euclidcan-invariant 1Z corresponding to cach c Q being constant. Now, if 
we start with such a Euclidcan-invariant 1Z and introduce a very slow variation 
in its c a (with 'very slow' meaning 'significant variation on lengthscales much 
longcr than the Compton wavelength'), then we would expect the vacuum entan- 
glement lengthscalc to remain substantially unchanged, which in turn suggests 
that IZr 1 !" 1 would remain non-local on the same lengthscales. 

Of course, we are using physical intuition to conjecture results of a math- 
ematical nature, and this conjecture is ultimately no substitute for rigorous 
results about the operators Tir 1 / 2 in the non-Euclidean-invariant case. 

6.2 Field-particle duality 

The map 

(«M — HM (6i) 

defines a vector-space isomorphism[] between Tíip and the classical phase space 
V; since it commutes with time evolution, it also defines an isomorphism between 
Tíip and the classical solution space. We can use this isomorphism to pull 
back the Hilbcrt-space structurc (i. e. , the complex structure and the inner 
product) from Tíip to phase space, and henec to solution space: thus, we gain 
a prescription by which we can regard V as a Hilbert space 

It is instructive to give the precise forms for the complex structure (i. e. , 
the linear operator J corresponding to multiplication by i) and inner product 
((,)). on V . We will express each in three ways: 

9 Strictly speaking the map takes phase space only into a proper subset of Ti.±p, becausc 
the image of the map is not complete in the Hilbert-space norm on by t he latter (induced from 



Us)- Our relaxed attitude to this is due to our approach (in section |2.3Q to renormalisation: we 
will regard H.ip as being cut off at some (very high) energy, thus making it fïnite-dimensional 
and removing the problem. 

10 The existence of a complex structu re on the classical phase space has long been known, 



and in faci is a central part of Segal's (1964) approach to quantisation (briefly discussed in 
section B.2h. 
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1 . In tcrms of q k and p k : 

J(q 1 ,p 1 ;...;q k ,p k ;...) = (-p 1: q x \ . . . ; -p k , q k ; . . .); 

(( (4,, tt), (0', V) » = 1 2K«*9Í + P*í4) + ^kPk - Pkql)}. (62) 
fe 

2. In terms of a^: 

J(ai, . . . ,ah, ■ ■ .) = (iai, . . . , iafc, . . .); 

(<(^7r),(^y)))=X;«K· (63) 
fc 

3. Directly in terms of (4>,ir): 

J(<f>,ir) = (-TZ- 1/2 7r,TZ 1/2 (j)); 

({(M,tfy))) = l í d 3 x (^(x)^ 1 / 2 0'(x) + 4x)^- 1 /V(x)) 

+ 1 /" d 3 x(i(x)^'(x)^(x),'(x)), (64) 
J s 

or 

(( (0, tt), (0', tt') )) = (K^U + iK- 1/4 ir, TZ 1/4 ^ + iK- 1/4 tt') (65) 

where ( , ) is the ordinary L 2 inner product on S. 

From (2) it is easy to confirm that J and (( , )) are prcserved by time-evolution. 
The fractional powers of TZ which occur in (3) teli us that J and (( , )) are not 
strictly local. 

Since Tíip and V are isomorphic, we can use this Hilbcrt-spacc dcscrip- 
tion of V to provide a coordinatisation of 7i\p itself. This gives us a sort of 
wave-function description, albeit one in which the complex structure and inner 
product are not locally defined. Such a description is an expression, in a sense, of 
wave-particle duality, with the same mathematical description applicable to the 
one-particle subspace of the quantum system, and to the whole classical system. 
Note, though, that the duality is critically dependent upon the linear structure 
of the solution space — hence we have no reason to regard field-particle duality 
as a general property of ficld theories, but only of linear (or nearly linear) ones. 

The duality also implies that dynamics on phase space must be writeablc in 
Schròdingcr-equation form. Indeed, it is straightforward to check that Hamil- 
ton's equations 

4> = tt; tt = —1Z4> (66) 

are equivalent to 

^(<f>,ir) = -JK 1 / 2 (<f>,7r), (67) 
so that the Hamiltonian is the (mildly nonlocal) operator 1Z 1 / 2 . 
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6.3 Alternative construction of partides: heurístic form 



We havc argucd that it is the intcrplay between two Hilbcrt-space structures 
— the quantum-mechanical one and the one on the classical solution space — 
which lcads to the emergence of partides, but it is perhaps somewhat obscure 
exactly how that interplay comes about. In this section and the next we will 
describe an alternative way to the particle subspace which possibly gives more 
insight into this question. 

The basis of our new method is as follows: since the vacuum is significantly 
entangled only on lengthscales of order the Compton wavelength, we would 
expect that the action of a field observable like </>(x) on the vacuum would 
create a state differing from the vacuum only in the vicinity of x — that is, 
a state L c -localised at x. We might further expect that, if / and g are real 
functions which vanish outside some spatial region Si, then the state 

3 x (/(x)$(x)+ fl (x)SF(x)) |íï) (68) 

would be L c -localiscd in Ei. Furthermore, the space of all such states ( p8|) is 
spanned by states of form 4>{x) |Í2) and 7r(x) |Í2) — that is, by states which we 
expect to be L c -localised at a point. 

It should be stressed that it is by no means obvious that these 'expectations' 
will be confirmcd: the Rceh-Schlieder theorem reminds us that the link between 
the localisation properties of operators and of those states created by the action 
of those operators on the vacuum is rather subtle. Nonctheless, if they are 
confirmed then the space of states of the form ( |68[ ) satisfies our first two criteria 
(on page |Ï2|) for a particle subspace: that the ELP holds for the space, and that 
the space is spanned by a basis of localised states. (And in fact they can be 
confirmed.) 

No use has yet been made of the linearity of the classical solution space, 
so it is perhaps unsurprising that this property is essential to (amongst other 
things) ensure that our space is to satisfy the third criterion for a one-particle 
space: that the space is at least approximately preserved under time evolution. 
For the linearity of the classical field equations is equivalent to the requirement 
that the Hamiltonian is quadratic in the ficlds and thcir conjugate momentum, 
and this in turn entails that (writing U(t) = cxp(-iHt) for the time-translation 
operator), 

C/(í)0(x)C/ t (í) = 0(x) - it \l·l, 0(x)1 - t 2 \l·l, \h, 0(x)l 1 + . . . (69) 

is a linear combination of <p an d 7? operators — hence, the time evolution of 
a state like </>(x) |Í2) is a state of form (|Í38"|). This clearly entails the closure of 
the space of states of form (É38|) under time-evolution; hence, it is a one-particle 
space. 
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6.4 Alternative construction of partides: technical details 



Let us develop the formal details of this sketch. We begin in classical mc- 
chanics: recali that a classical observable is a real functional on the phase space 
(so in onc-particlc mcchanics the position observable assigns to cach phase-space 
point its configuration-spacc position, etc.) It will be important to prcserve, in 
the following, the distinction between the phase-space point (</>, 7r), which is a 
state of the classical field, and the classical observables </>(x) and 7r(x), which 
are functionals on the space of field states: the action of 0(x) on a field state 
returns the field strength at the point x.^| To make this distinction clear, in 
this section we will distinguish classical observables by writing thcm with a bar 
on top of them: </>(x), for instance. Thus, the observables </>(x) and 7r(x) are 
defined by 

WiKM] = (70) 

and 

^O[(0,7r)]=7r(x). (71) 

The space of observables in any classical-mcchanical system has a natural 
linear structure: {A + B)[v] := A[v] + B[v]. Furthcrmore, we can define time 
evolution for observables as follows: if, for any phase-space point v, v(í) denotes 
where that phase-space point has moved to aftcr time t, then we define 

A(t)[v] :=A[v(t)}. (72) 

If H is the classical Hamiltonian then we can write A(t) in the symbolic form 

A(t) =exp(-t{H,-})A, (73) 

(which is to be undcrstood as denoting the powcr-series expansion 

\n=0 ' / 

where {ff, •} A := jiï, A} .) This movement of the dynamics from the states to 
the observables is very similar, both conceptually and mathematically, to the 
move from the Schròdinger to the Hcisenberg picture in quantum mechanics. It 



is discussed in morc dctail by Woodhouse (1991, p.20 et seq. ) 



Although we gcncrally regard observables as real functionals, there is nothing 
to prevent us expanding the class of observables to inelude complex functionals, 
and we shall do so from here on. Any complex observable, of course, has the 
form A + iB, where A and B are real observables. 

So far, everything we have said about observables applies to any classical 
system. If, however (as in the case of linear field theory) the classical phase 
space has a linear structure which is preserved under time evolution, then it 

11 Similarly, in particlc mcchanics therc is a distinction between a particle's position x and 
the observables x 1 , which are functionals returning the ith component of a particle's position. 
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is possible to define real-linear observables as those real observables which are 
real-linear functionals; since the linear structure commutes with time evolution 
on phase space thcn it is easy to verify that it commutes with time evolution 
on the space of linear observables as wcll. In the spccific case of classical fields, 
the linear observables are precisely those of form 

í d 3 x (/(x)0(x)+. 9 (x)7f(x)) (75) 

with / and g real functions. We will refer to any such observable as L c -localiscd 
in a region Ei if the functions / and g are L c -localised in that region. 



Since, as we have seen (in section p.2| ), the classical phase space can be 
given the structure of a complex Hilbert space,Q we can also define complex, 
conjugate-linear observables in the obvious way (i. e. all those complex observ- 
ables which are real-linear and which satisfy C[J(<f>, n)] = — iC[(cf), 7r)].) It is 
easy to see that each such observable can be written as C = A + iB, with A 
and B real-linear observables. 

Now, it is a bàsic result of Hilbert space theory (the Riesz representation 
theorem) that the space of conjugate-linear functionals on a Hilbert space is 
isomorphic to that Hilbert space: in other words, that to each conjugate-linear 
functional A there corresponds a unique vector u such that, for any v, A[v] = 
(u, v). Hcnce, the space of complex conjugate-linear observables over the phase 
space of a classical linear field, conceived as a Hilbert space, is Hilbert-spacc- 
isomorphic to the phase space itself. This isomorphism takes the somewhat 
awkward form 

(4>,tt) < — ► J d 3 x ((ft 1/2 </>(x) - »tt(x))0(x) + (^ 1/2 tt(x) +i0(x))7f(x)) 

(76) 

from which we can see that it preserves L c -locality. It takes a far simpler form 
(albeit one in which its locality is obscured) if we use the ak coordinates for 
(0, 7r): if we define the observable a* k by a£[{aíi, . . . a n , ■ ■ ■}] '■= otk then the 
isomorphism is 

{a u . . .a n , . . .} < — >^2a k a" k . (77) 

k 

The move from classical to quantum mechanics can be thought of as an 
algorithm taking classical observables to quantum operators; it cannot be ap- 
plied to all observables but in field theory it docs apply to all of the real-linear 
observables, being generated by the maps 

0(x) — » 0(x) (78) 

and 

7f(x) — ► tt(x). (79) 

12 As was mentioned briefly beforc (in footnotc ^ on pagc ^) this is strictly correct only if 
we complete the classical phase space in the norm generated by its inner product. 
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The quantization map is linear, commutes with time evolution, preserves locality 
(trivially) and can be extended to complex conjugate-linear observables in the 
obvious way: A + iB — > A + iB. As might be expected, it can be written as 

ak7[ *k — y Yj akà k ■ (80) 
fe fe 

Wc have now moved from classical states, to classical observables, to quan- 
tum observables. Finally, wc move to quantum states by applying the appro- 
priate quantum observable to the vacuum state |fi). If the vacuum is entanglcd 
on lengthscales of L c , then we expect (though it is not a priori guaranteed) that 
this map, too, is i c -local. 

This sequence of three linear isomorphisms, each commuting with time evo- 
lution, can be summarised in the following diagram. 




Time evolution 



via Hamilton's equations 



Isomorphism 
between states 
and functionals 



Classical 
observable 



Quantization 



Quantum 
operator 
Efc a kà{ 



Apply to 
vacuum 



Quantum 
particle 



Time evolution 



via A 



{H,A} 



Unitary time-evolution 



(Heisenberg picture) 



Unitary time-evolution 



(Schròdinger picture) 



Classical 

state 

{...,a n e-^,...} 



Isomorphism 
between states 
and functionals 



Classical 
observable 



Quantization 



Quantum 
operator 

Efe"fe e 



iuJkt aï 



Apply to 
vacuum 



Quantum 
particle 
E fc " fc e-^*4|Q) 



The important properties of the diagram are: 
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• Each map on the diagram is linear; hence, the linear structure on classical 
phase space transfers to the one-particle subspace. 



• Nonetheless, the linear structures mean very different things: the lin- 
ear structures on the classical and quantum observables are conceptually 
closely related to one another, but are conceptually different from the lin- 
ear structure on the classical phase space, which in turn is conceptually 
different from the linear structure on the one-particle space. 

• The vertical maps preserve L c -localization; hence the one particle subspace 
satisfies the effective localisation principle on scale L c and is spanned by 
states which are L c -localiscd at a point. (This is true by definition between 
the first and second lines (cf . page |3^) , and trivial to show between the 
second and third lines. The substantive step is the one between the third 
and fourth lines — between quantum operators and quantum steps. It 



is made plausible by the observations of section 6.1 (that the vacuum is 



significantly entangled only on lengthscales of ~ L c ), and proved by the 



calculations of section 5.E 



• The diagram commutes; in other words, time evolution commutes with 
each horizontal map. This cnsurcs that states within the one-particle sub- 
space remain in that space, i. e. that particle states are taken to particle 
states under time evolution. 

Compared with the coherent-states method used earlier to construct parti- 
des, this new method has the advantagc of showing more clearly how the linear 
structure transfers from classical to quantum states; however, it provides much 
weaker reasons for believing that the map between classical and quantum states 
preserves L c -locality. 

(Of course, even in the coherent-state method, the argument given for parti- 
cle locality was only heurístic, relying on the assumption that if \C) is a coherent 
state L c -localised in some region, then \C) — |Í2) is L c -localised in the same re- 
gion. Ultimately, the only way to check L c -locality is by direct calculation.) 



7 The Newton-Wigner representation of Tíip 

In this section, we will consider the so-called 'Newton-Wigner' definition of 
local states, which we will generalise from Lorentz-invariant QFTs to the linear 
QFTs discussed above. 

7.1 Mathematical argument for the Newton-Wigner representation 

Although the representation of Tíip given in the previous section is a sort 
of configuration-space representation, it is somewhat awkward to use compared 
to the configuration-space representations which we are aceustomed to in non- 
relativistic quantum mechanics — essentially because of the non-local nature of 
the inner product and complex structure. 
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Purely for mathematical convenience, it would be useful to find a way of 
transforming our current representation into one where the inner product and 
complex structure are represented as local operations — such a transformation 
would incvitably have to bc nonlocal itsclf, but since we realise that the particle 
states are unavoidably mildly nonlocal (on seales of L c ) this is not problemàtic. 
Finding such a transformation is straightforward: it is given by 

{(/>, tt) — n 1/4 <f> - iK- 1/4 n, (81) 

or, in terms of the ctk coeficients, by 

|0,7r> = 5> fe |fc)^$> fc / fe (x). (82) 
fe fe 

It is straightforward to verify that in the new representation, the complex struc- 
ture is just multiplication by i and the inner product is the usual L 2 -product. 
Unlcss (0, 7r) is localised in a region small compared with L c thcn its locali- 
sation will not be significantly changed by the transformation; hence, the new 
representation is an equally aceurate representation of the actual localisation 
properties of the quantum states. In the ca sc of Klein-Gordon theory , this is 
known as the Newton- Wigner representation ( Newton and Wigncr 1949| ) and we 
will adopt this nomenclature for all such representations (i. e. , including QFTs 
other than Klein-Gordon theory). 

If we denote the transformation from the phasc-space representation of 7iip 
to the Newton- Wigner representation by Aí, then its complex-linearity is equiv- 
alent to the statement 

iN = NJ. (83) 
Applying Af to both sides of the Schròdinger equation (|67j), we get 

^-Af((j>, tt) = -NJK l ' 2 {(j>, tt) = -ÏR}' 2 N{(i>, tt) (84) 

so that the Hamiltonian in the Newton- Wigner representation is again TZ 1 ^ 2 . 
In the specific case of Klein-Gordon theory, we have 1Z 1 ! 2 = (to 2 — V 2 ) 1 / 2 , and 
in the non-relativistic limit this is approximately equal to m — V 2 /2m, so that 
we recover the non-relativistic free-particle Hamiltonian (up to an additive con- 
stant, the rest energy of the particle). It might appear that we have replaced a 
nonlocal operator with a local one, but for the nonrelativistic approximation to 
be vàlid we require that the wave-function contains only a negligible contribu- 
tion from eigenstates which don't satisfy |k| <C to — which in turn means that 
it must be localised in a region of size 3> L c . 

7.2 Conceptual significance of the Newton- Wigner representation 

There is another way to motivate the Newton- Wigner representation, based 
more on conceptual than mathematical grounds. In Dirac's formulation of non- 
relativistic quantum mechanics, the configuration space wave-function is intro- 
duced by 

V>(x) = (xM , (85) 
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where the states |x) are the eigenstates of the position operator — so that the 
integral of the wave-function over a small region gives the amplitude for the 
particle to be in that region. This fits well with our description of a one-particle 



space in section B.2 as a subspacc spanned by wcll-localiscd states (indeed, in the 
non-relativistic case such states can be arbitrarily well localised) and it would 
be useful to have a similar representation of QFT partides. 

To what extent does the Newton- Wigner representation provide this? The 
delta functions are certainly formally equivalent to position eigenstates, being 
perfectly localised in configuration space and forming an (improper) basis for 
the one-particle Hilbert space. But obviously they are not precisely localised in 
real space: if \^ínw) is the abstract ket corresponding to a delta-function at X, 
then we have 

|x w > = E/fc( x )l fc >> ( 86 ) 
k 

and it is easy to verify that (for instance) 

(x w | 0(y)0(y) |x w ) - (íï| 0(y)0(y) |fi) (87) 
is formally equal to 

if^/^x-y)] 2 , (88) 

and hence is localised only within a region of size ~ L c . 

This is not a problem, however. Recali that even in non-relativistic quantum 
mechanics, no particle can actually be placed in an eigenstate of position (such 
states are not even in Hilbert space). Rather, such states are to be seen as 
an idealised mathematical limit of a sequence of successively better-localised 



states (see, e. g. , Cohen-Tannoudji, Diu, and Laloè (1977, pp. 100-105)). We 



can apply the same approach here, with the proviso that the successive terms of 
this sequence are only successively better localised in physical space (as opposed 
to in Newton- Wigner configuration space) up to a point: the point at which the 
terms are localised in regions which are not large in comparison with L c . After 
that point the states will remain localised in a region of size ~ L c irrespective 
of how localised the Newton- Wigner wavefunction is. But providing that the 
wavefunctions which we arc studying do not thcmsclves vary on lengthscalcs 
comparable to L c , we will only necd thosc terms in the sequence which are large 
compared with L c — hence, we may effectively use the concept of 'position 
eigenstate'. 

Providing the Newton- Wigner wavefunction varies slowly on lengthscales of 
size L c , we can regard the integral of the function over regions of this size as 
giving the amplitude to find it in a state effectively localised in the region. This 
then allows us to define an effective position operator: 



d J x \yL NW ) (*nw\ ■ (89) 



This operator will perform the same task as the non-relativistic position op- 
erator, provided that we are not interested on seales small compared with the 
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minimum localisation scale: its expectation value will give the expected value of 
the particle's position upon measurement, and projections built from its spec- 
trum correspond to projections onto the particle being in a given region (always 
assuming that region to be largc comparcd with the minimum localisation scale 
L c ). 

However, the relationship between this position opcrator and the Newton- 
Wigner position states is opposite from that which holds between non-relativistic 
position operators and position eigenstates (at least in some presentations). In 
the latter case, the position operator is our starting point and position eigen- 
states are position eigenstates because they are eigenstates of the position op- 
erator. In QFT, the Newton- Wigner position states are approximately local 
because of their relationship with the field observables, and the position opera- 
tor is constructed from thcm. 



7.3 "Pathological" features of the Newton- Wigner representation 



The Newton- Wigner representation of Klcin-Gordon partides is wcll known 
to have two apparently pathological features: eigenstates of position do not re- 
main eigenstates under Lorentz transformations, and Newton- Wigner wavcfunc- 
tions spread out superluminally. From the current perspective, both properties 
are curiosities rather than pathologies. The Newton- Wigner representation is 
not covariantly defined, so there is no mathematical reason to expect localisa- 
tion to be totally unaffected by Lorentz boosts — but the boosts do not cause 
one-particle states which are effectively localised in a given region to stop being 
localised in that region. Similarly, Newton- Wigner wave-functions do spread 
faster than light — but the transformation between phase space V and the 
Newton- Wigner representation commutes with time evolution, so the Newton- 
Wigner state continues to represent a state effectively localised in the region 
occupied by a classical state which is propagating at subluminal speeds. 

As such, the Newton- Wigner representation is a perfectly legitimate, and 
often very convenient, way of describing states in the one-particle subspace 
- but it doesn't give the exact truth of the matter as to where partides are 
localised, because there isn't one: partides are superpositions of field excitations 
with finite size, so any attempt to give a wave-function description down to 
arbitrarily small seales is inevitably going to be itself arbitrary at those seales. 

Bcfore ending this discussion, I should note that there is an alternative tradi- 
tion (going back to ( Bcgal 1964 ), and currently defended by Fleming (1996| )) that 
takes Newton- Wigner-localised states as by definition localised. In this view- 
point, localisation of states is defined directly in terms of the Newton- Wigner 
position operator, rather than (as here) via the localisation properties of the 
operator àlgebra. Such an approach, by taking the Newton- Wigner position op- 
erator as having fundamcntal significance, must of course confront the apparent 
pathologies of the Newton- Wigner representation. 

This paper, however, has as a starting assumption that localisation is defined 
via the operator àlgebra, and as such I will not discuss Fleming's approach. The 
reader interested in how (and if) the alternative tradition can come to terms 
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with the pathologies should see Fleming and Buttcrficld (1999|) as wel l as the 
recent exchange of views between Fleming (2000| ) and Halvorson (2001). 



8 Comparisons with other methods 

This section is a brief comparison of the results of this paper with two 
other approaches to quantum field theory: the collision theory dcveloped in 
the framework of algebraic QFT by Haag and Ruelle, and Segal's approach to 
quantizing linear systems. 



.1 Comparison with Haag-Ruelle collision theory 



Haag-Rucllc theory (discussed in Haag (1996 ), and references therein) is an 



analysis of collision theory, and of the particle content of quantum ficlds, within 
the framework of algebraic QFT. Conceptually speaking this is esscntially the 
same framework used here (although treated with significantly more mathemat- 
ical caref^l) : the field is taken as primary and partides are regarded as emergent 
concepts; thus, this paper's approach is complementary to, and not in conflict 
with, Haag-Ruelle theory. 

A detailed description of the theory lies beyond the scope of this paper, 
but esscntially it applics to any Poincaré-invariant QFT which has a discrete 
eigenvalue of the mass observable (other than zero) and for which the mass 
spectrum of the subspace orthogonal to the vacuum has a lower bound away 
from zero (so that it implies, for instance to massivc Klcin-Gordon theory). And 
it establishes that at asymptotically early and late times, 

1. The QFT behaves as a free, massive QFT; 

2. Any state which lies in the iV-particle sector of that free theory can set 
off at most N widely separated detectors. 

Note that the theory applies to interacting as well as to free theories; how- 
ever, it only applies at asymptotically early and late times, and implies that any 
QFT is effectively free at such times; thus, it entails a particle interpretation 
only for effectively free fields (which is the case described above) . Its virtue over 
the approach of this paper is its generality (it applics to fermionic as well as 
bosonic fields, and to composite as well as 'elementary' partides) and its abil- 
ity to infer the particle propertics of the field directly from its mass spectrum, 
without recourse to concepts of field-particle duality. However, for this very 
reason the close relationship between classical solutions and one-particle states 
is somewhat obscure in the Haag-Ruelle analysis. 

At first sight, the generality of the Haag-Rucllc analysis might seem to under- 



mine the argument of section S.2 that the particle concept is esscntially bound 
up with the lincarity of the field theory in question. However, the analysis itself 
telis us that the existence of a massivc asymptotic limit of a field theory, and 
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See Wallacc (2001b) for a discussion of the relationship between algebraic QFT and the 



prima facie less rigorous QFT described here and used in mainstream physics. 



38 



the existence of a discrete nonzero mass, are really equivalent statements. In 
the context of the present paper, the Haag-Ruellc analysis should remind us 
that the linear theories we are studying should be regarded as effective, asymp- 
totic limits of intcracting ficld theories, and not as fundamental ficlds in thcir 
own right: for instance (though tcchnically it lies rather beyond the scope of 
this paper sincc it deals with fermionic partides) we would expect QCD to be 
asymptotically analysablc in terms of 'proton' and 'neutron' fields, but these 
bear little resemblance to the fundamental — and strongly interacting — quark 
fields of QCD. 

(Technically, the analysis of this paper might seem more general than the 
Haag-Ruelle theory since the latter is confined to Poincaré-invariant QFTs. 
However, we can extend Haag-Ruelle theory to general translation-invariant 
QFTs by generalising its spectral condition to require that the 4-momentum 
spectrum includes a discrete surface in 4-momentum space which nowhcrc in- 
tersects the origin (we get this for free by Lorentz invariance in a relativistic 
theory, once we havc spccificd a discrete mass cigcnstate). It is at best unclcar 
how to apply Haag-Ruelle theory to QFTs without translational symmetry; it 
should, though, be acknowledged that the above analysis is also not entirely sat- 
isfactory in such situations as we have only a heuristic argument for the approx- 
imate locality of non-translation-invariant 1Z X operators — recali the discussion 



of section 6.1.) 

8.2 Comparison with Segal quantization 

It is instructive to compare the field-quantization approach given above with 



the quantization program developed by Segal (1964) and others (see Saunders 



(1992) for a foundational discussion). In Segal's approach, we begin by choosing 
a complex structure J on the phase space of a linear field,|^ which we require to 
have certain properties (spccifically, for any states u, v, we require J to satisfy 
Í2( Ju 1 Jv) = Í2(ií, v), and Q( Ju, u) > 0, where n(-, •) is the symplectic form on 
V). We then use J to define an inner product on V , given by 

({u,v)) = ü(Ju,v) -iü(u,v). (90) 

This (following norm-complction) makcs V into a Hilbcrt space (the 'one-particle 
Hilbert space'), on which we can easily show that the classical dynamics are 
unitary. The Fock space generated from that Hilbcrt space is then takcn to be 
the Hilbert space of the QFT, and there is a very natural dcfinition of the action 
of the ficld operators upon that space. 

This is effectively the reverse procedure to ours. In Segal's approach, we 
begin with the Hilbert-space structure on phase-space, declare it to be the one- 

14 There is another approach very closely related to Segal quantization, in which instead 
of choosing a complex structure, we complexify the phase space and then choose a subspacc 
of (formally) half the dimension of the complexified phase space. This process is at least for- 
mally equivalent to choosing a c omplex struc ture. though there are some infinite-dimensional 



tcchnicalities to take care of: see Wald (1994) for an exposition of this method, and especially 
p. 46 of his book for the relationship with the Segal method. 
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particle Hilbert space, and then build the full QFT Hilbert space TLs from that 
structure; in ours, we begin with Tís, identify a subspace which deserves to be 
called the 'one-particle' space, and then use it to give a Hilbert-space structure 
to V. 

This highlights an important conceptual diffcrcncc bctwcen Segal quantiza- 
tion and the 'canonical' method of quantization used here. In our approach, the 
mcthod used to quantize a (bosonic) ficld theory is csscntially the same irrespec- 
tive of whether that theory is linear: throw the theory into Hamiltonian form 
and look for a map from field observables to operators on some Hilbert space, 
such that Poisson brackets become commutators. Then if the classical theory 
is linear, analysis of the quantum theory reveals the one-particle subspace. In 
Segal's approach, on the other hand, the linearity is fundamental to the entire 
quantization algorithm, and it is most unclear how nonlinear field theories are 
to be accommodated. 

The attitude which one may take towards this difference depends on one's 
confidcncc in the mathcmatical and conceptual status of nonlinear QFTs - 
in particular, the status of the infinitics which arise in the dynamics of such 
theories, and of the rcnormalisation process used to tame them. If (with the 
author; sec Wallacc 2001b) one takes a relaxed attitude to thcse infinities, then 
the Segal process becomes an elegant curiosity: a nice way to understand linear 
quantization, but of little relevance to general QFTs. If, on the other hand, 



one regards the infinities as a pathological problem in QFT, then (as Saunders 



(1992) has argued) it may be a virtue of the Segal approach that it forces us 



to look for radical ways of reformulating the quantization of nonlinear ficld 
theories. 

It is also possible to see Segal quantization as pointing to a different view 
of the relationship betwecn field and particle than the one defended here. As 
mcntioncd above, in Segal quantization it is necessary to begin by choosing a 
complex structure on the classical phase space, and different choices of structure 
lead to one-particle spaces with different properties. If one believes that there 
is a 'right' choice of complex structure, then there is also a 'right' one-particle 
space, and that space is a fundamental building block when assembling the field 
Hilbert space; this is at least suggestive of a more even relationship between 
field and particle concepts than the field-is-primary viewpoint advocated above. 

On the other hand, it is also reasonable to regard the choice of a complex 
structure as just being a matter of taste, in which case the importance of the 
one-particle subspace is downgraded. One argument in favour of this attitude 
is that in general it is very unclear what rule could pick out the 'right' complex 
structure: there is a unique prescription in the case of a time-independent linear 
QFT (this includes the QFTs considered above) but in, say, the presence of a 
time-dependent potential, or in a curved spacetime, then there does not appear 
to be any general rule by which one complex structure could be preferred over 
another. This is, of course, a variant of the last of the arguments which (in 
section 2.4) led us to regard field as primary in the first place. 



40 



9 Conclusion 



We have shown that the one-particle subspace of a free, bosonic, massive 
quantum field emerges very naturally as a consequence of that field's classical 
linearity, and that the interplay between classical and quantum linear structures 
is such as to give those particle states the right sort of locality properties. 

It is important to remember that this sort of construction of particle spaces 
is not intended to be any sort of explanation as to why the world around us is 
observed to be particulate. At best it is able to show how such obscrvations are 
compatible with the validity of QFT — the question of why the actual world 
appears to be in a particulate state is much subtler. It presumably requircs 



considcration of decohcrence theory (see Anglin and Zurck (1996 ) for some ideas 



along these lines), and more generally it involves the measurement problcm. 

It is also important to keep in mind the limitations on our construction. Al- 
though these results should extend straightforwardly to multi-componcnt fields, 
they do not apply to the important cases of fermionic, and massless, fields. In 
the former case we would expect the particle concept to be more, not less, impor- 
tant, due to the Pauli exclusion principle; but the fact that spacelike-separated 
Fermi field operators anticommute rather than commute suggests that we can 
no longer visualise the QFT Hilbcrt spacc in tcrms of wave-functionals on con- 
figuration space. In the case of massless fields, we expect the particle concept to 
be much more subtlc:^| QED, for instancc, possesses not only regimes in which 
the electromagnètic field is analyzablc in tcrms of frec photons, but regimes in 
which it should be thought of as a classical field, and othcrs in which it simply 
provides a long-range force between nonrelativistic partides. 

This brings up pcrhaps the most substantial limitation of this paper's ap- 
proach to partides — and indeed of any approach which analyses partides in 
terms of asymptotic behaviour. Although it may well be the case that there is 
no particulate description of high-energy phenomena except at asymptotically 
early and late times, atòmic and solid-state physics — not to mcntion biology 
and chemistry — providc us with a wealth of situations in which 'partides' 
(spccifically, electrons and atòmic nuclei) are strongly interacting and yet still 
maintain their own particle character. Analysing this situation would require a 
deep understanding of the nonrelativistic limit of massive and massless QFTs, 
and lies far beyond this paper's scope. 

Nonetheless, despite the limitations of this approach it is satisfying to find 
that ficld-particlc duality can be understood in the context of a field ontology, 
and intriguing to observe the elegant and somewhat subtle ways in which the 
various linear and complex structures present intertwine to make this under- 
standing possible. 
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This is supported by the situation in algcbraic QFT, in which dcvclopmcnt of a satisf actori 



collisi on th corv for fields w ithout a mass gap is at a much more rudimentary stage; see Haaj 
(1996[ ) and Buchholz (200C) for recent discussions. 
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